First Story Detection using Entities and Relations

نویسندگان

  • Nikolaos Panagiotou
  • Cem Akkaya
  • Kostas Tsioutsiouliklis
  • Vana Kalogeraki
  • Dimitrios Gunopulos
چکیده

News portals, such as Yahoo News or Google News, collect large amounts of documents from a variety of sources on a daily basis. Only a small portion of these documents can be selected and displayed on the homepage. Thus, there is a strong preference for major, recent events. In this work, we propose a scalable and accurate First Story Detection (FSD) pipeline that identifies fresh news. In comparison to other FSD systems, our method relies on relation extraction methods exploiting entities and their relations. We evaluate our pipeline using two distinct datasets from Yahoo News and Google News. Experimental results demonstrate that our method improves over the state-of-the-art systems on both datasets with constant space and time requirements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IREvent2Story: A Novel Mediation Ontology and Narrative Generation

Event detection is a key aspect of story development which is composed of multiple narrative layers. Most of the narratives are template-based and follow a narration theory. In this paper, we demonstrate a narrative from events detected in the international relations domain along with classification of events using our novel mediation ontology. We also introduce a novel method of classifying ev...

متن کامل

Story Link Detection Based on Event Words

In this paper, we propose an event words based method for story link detection. Different from previous studies, we use time and places to label nouns and named entities, the featured nouns/named entities are called event words. In our approach, a document is represented by five dimensions including nouns/named entities, time featured nouns/named entities, place featured nouns/named entities, t...

متن کامل

Rhetorical Structure Analysis of EFLs’ Written Narratives of a Picture Story

This study was set to reveal how second language learners use rhetorical relations in their written narratives in terms of Rhetorical Structure Theory (RST) primarily proposed by Mann & Thompson (1987) and developed by Mann, Matthiessen & Thompson (1992). To this end, sixty written narratives based on the picture story book ‘Frog, where are you?’ were collected from EFL learners and were put to...

متن کامل

TWO-STAGE METHOD FOR DAMAGE LOCALIZATION AND QUANTIFICATION IN HIGH-RISE SHEAR FRAMES BASED ON THE FIRST MODE SHAPE SLOPE

In this paper, a two-stage method for damage detection and estimation in tall shear frames is presented. This method is based on the first mode shape of a shear frame. We demonstrate that the first mode shape slope is very sensitive to the story stiffness. Thus, at the first stage, by using the grey system theory on the first mode shape slope, damage locations are identified in shear frames. Da...

متن کامل

Character Profiling in 19th Century Fiction

This paper describes the way in which personal relationships between main characters in 19 century Swedish prose fiction can be identified using information guided by named entities, provided by a entity recognition system adapted to the 19 century Swedish language characteristics. Interpersonal relation extraction is based on the context between two relevant, identified person entities. The re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016